Robust Argumentative Zoning for Sensemaking in Scholarly Documents
نویسندگان
چکیده
We present an automated approach to classify sentences of scholarly work with respect to their rhetorical function. While previous work that achieves this task of argumentative zoning requires richly annotated input, our approach is robust to noise and can process raw text. Even in cases where the input has noise (as is obtained from optical character recognition or text extraction from PDF files), our robust classifier is largely accurate. We perform an in-depth study of our system both with clean and noisy inputs. We also give preliminary results from in situ acceptability testing when the classifier is embedded within a digital library reading environment.
منابع مشابه
CoZo+ - A Content Zoning Engine for textual documents
Content zoning can be understood as a segmentation of textual documents into zones. This is inspired by [6] who initially proposed an approach for the argumentative zoning of textual documents. With the prototypical Cozo+ engine, we focus on content zoning towards an automatic processing of textual streams while considering only the actors as the zones. We gain information that can be used to r...
متن کاملA Weakly-supervised Approach to Argumentative Zoning of Scientific Documents
Argumentative Zoning (AZ) – analysis of the argumentative structure of a scientific paper – has proved useful for a number of information access tasks. Current approaches to AZ rely on supervised machine learning (ML). Requiring large amounts of annotated data, these approaches are expensive to develop and port to different domains and tasks. A potential solution to this problem is to use weakl...
متن کاملAccurate Argumentative Zoning with Maximum Entropy models
We present a maximum entropy classifier that significantly improves the accuracy of Argumentative Zoning in scientific literature. We examine the features used to achieve this result and experiment with Argumentative Zoning as a sequence tagging task, decoded with Viterbi using up to four previous classification decisions. The result is a 23% F-score increase on the Computational Linguistics co...
متن کاملAutomatic Critiquing of Novices’ Scientific Writing Using Argumentative Zoning
Scientific writing can be hard for novice writers, even in their own language. We present a system that applies Argumentative Zoning (AZ) (Teufel & Moens 2002), a method of determining argumentative structure in texts, to the task of advising novice writers on their writing. We address this task by automatically determining the rhetorical/argumentative status and the implicit author stance of a...
متن کاملSensemaking tools for understanding research literatures: Design, implementation and user evaluation
This paper describes the work undertaken in the Scholarly Ontologies Project. The aim of the project has been to develop a computational approach to support scholarly sensemaking, through interpretation and argumentation, enabling researchers to make claims: to describe and debate their view of a document’s key contributions and relationships to the literature. The project has investigated the ...
متن کامل